Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 1381 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 70.0 KiB |
| Average record size in memory | 51.9 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 6 |
Empresa is highly correlated with Annual Turnover and 5 other fields | High correlation |
Annual Turnover is highly correlated with Empresa and 5 other fields | High correlation |
Employee Count is highly correlated with Empresa and 5 other fields | High correlation |
Activos Fijos is highly correlated with Empresa and 5 other fields | High correlation |
Aon Office_cat is highly correlated with AonOffice_cat and 1 other fields | High correlation |
Industry_cat is highly correlated with IndustryCode | High correlation |
TIER_cat is highly correlated with Empresa and 5 other fields | High correlation |
AonOffice_cat is highly correlated with Aon Office_cat and 1 other fields | High correlation |
AonOfficeCode is highly correlated with Aon Office_cat and 1 other fields | High correlation |
IndustryCode is highly correlated with Industry_cat | High correlation |
TIERCode is highly correlated with Empresa and 5 other fields | High correlation |
TIERcode is highly correlated with Empresa and 5 other fields | High correlation |
Empresa is highly correlated with TIER_cat and 2 other fields | High correlation |
Annual Turnover is highly correlated with Activos Fijos | High correlation |
Activos Fijos is highly correlated with Annual Turnover | High correlation |
Aon Office_cat is highly correlated with AonOffice_cat and 1 other fields | High correlation |
Industry_cat is highly correlated with IndustryCode | High correlation |
TIER_cat is highly correlated with Empresa and 2 other fields | High correlation |
AonOffice_cat is highly correlated with Aon Office_cat and 1 other fields | High correlation |
AonOfficeCode is highly correlated with Aon Office_cat and 1 other fields | High correlation |
IndustryCode is highly correlated with Industry_cat | High correlation |
TIERCode is highly correlated with Empresa and 2 other fields | High correlation |
TIERcode is highly correlated with Empresa and 2 other fields | High correlation |
Empresa is highly correlated with Annual Turnover | High correlation |
Annual Turnover is highly correlated with Empresa and 4 other fields | High correlation |
Employee Count is highly correlated with TIER_cat and 2 other fields | High correlation |
Activos Fijos is highly correlated with Annual Turnover and 3 other fields | High correlation |
Industry_cat is highly correlated with IndustryCode | High correlation |
TIER_cat is highly correlated with Annual Turnover and 2 other fields | High correlation |
IndustryCode is highly correlated with Industry_cat | High correlation |
TIERCode is highly correlated with Annual Turnover and 2 other fields | High correlation |
TIERcode is highly correlated with Annual Turnover and 2 other fields | High correlation |
TIER_cat is highly correlated with TIERCode and 2 other fields | High correlation |
TIERCode is highly correlated with TIER_cat and 2 other fields | High correlation |
TIER GENERAL is highly correlated with TIER_cat and 2 other fields | High correlation |
TIERcode is highly correlated with TIER_cat and 2 other fields | High correlation |
Empresa is highly correlated with TIER GENERAL and 3 other fields | High correlation |
Annual Turnover is highly correlated with Employee Count and 1 other fields | High correlation |
Employee Count is highly correlated with Annual Turnover | High correlation |
Activos Fijos is highly correlated with Annual Turnover | High correlation |
Aon Office is highly correlated with Aon Office_cat and 2 other fields | High correlation |
Industry is highly correlated with Industry_cat and 1 other fields | High correlation |
TIER GENERAL is highly correlated with Empresa and 3 other fields | High correlation |
Aon Office_cat is highly correlated with Aon Office and 2 other fields | High correlation |
Industry_cat is highly correlated with Industry and 1 other fields | High correlation |
TIER_cat is highly correlated with Empresa and 3 other fields | High correlation |
AonOffice_cat is highly correlated with Aon Office and 2 other fields | High correlation |
AonOfficeCode is highly correlated with Aon Office and 2 other fields | High correlation |
IndustryCode is highly correlated with Industry and 1 other fields | High correlation |
TIERCode is highly correlated with Empresa and 3 other fields | High correlation |
TIERcode is highly correlated with Empresa and 3 other fields | High correlation |
Activos Fijos is highly skewed (γ1 = 24.57726945) | Skewed |
Empresa has unique values | Unique |
Annual Turnover has unique values | Unique |
Activos Fijos has 44 (3.2%) zeros | Zeros |
Produccion has 14 (1.0%) zeros | Zeros |
Aon Office_cat has 77 (5.6%) zeros | Zeros |
Industry_cat has 15 (1.1%) zeros | Zeros |
AonOffice_cat has 77 (5.6%) zeros | Zeros |
AonOfficeCode has 77 (5.6%) zeros | Zeros |
IndustryCode has 15 (1.1%) zeros | Zeros |
Reproduction
| Analysis started | 2022-09-10 00:10:06.312526 |
|---|---|
| Analysis finished | 2022-09-10 00:11:12.449384 |
| Duration | 1 minute and 6.14 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 1381 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 726.0246198 |
| Minimum | 1 |
|---|---|
| Maximum | 1753 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 70 |
| Q1 | 357 |
| median | 719 |
| Q3 | 1088 |
| 95-th percentile | 1410 |
| Maximum | 1753 |
| Range | 1752 |
| Interquartile range (IQR) | 731 |
Descriptive statistics
| Standard deviation | 429.3227454 |
|---|---|
| Coefficient of variation (CV) | 0.5913335907 |
| Kurtosis | -1.153345119 |
| Mean | 726.0246198 |
| Median Absolute Deviation (MAD) | 366 |
| Skewness | 0.06073991387 |
| Sum | 1002640 |
| Variance | 184318.0197 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 132 | 1 | 0.1% |
| 219 | 1 | 0.1% |
| 657 | 1 | 0.1% |
| 1029 | 1 | 0.1% |
| 989 | 1 | 0.1% |
| 557 | 1 | 0.1% |
| 570 | 1 | 0.1% |
| 954 | 1 | 0.1% |
| 523 | 1 | 0.1% |
| 120 | 1 | 0.1% |
| Other values (1371) | 1371 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 1753 | 1 | |
| 1751 | 1 | |
| 1511 | 1 | |
| 1496 | 1 | |
| 1495 | 1 | |
| 1494 | 1 | |
| 1493 | 1 | |
| 1492 | 1 | |
| 1491 | 1 | |
| 1490 | 1 |
Annual Turnover
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 1381 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.171250896 × 1011 |
| Minimum | 17990 |
|---|---|
| Maximum | 6.2615849 × 1013 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.9 KiB |
Quantile statistics
| Minimum | 17990 |
|---|---|
| 5-th percentile | 1130458656 |
| Q1 | 1.7007349 × 1010 |
| median | 6.3554511 × 1010 |
| Q3 | 2.120179646 × 1011 |
| 95-th percentile | 1.273922 × 1012 |
| Maximum | 6.2615849 × 1013 |
| Range | 6.261584898 × 1013 |
| Interquartile range (IQR) | 1.950106156 × 1011 |
Descriptive statistics
| Standard deviation | 2.38142605 × 1012 |
|---|---|
| Coefficient of variation (CV) | 5.709141238 |
| Kurtosis | 425.7200292 |
| Mean | 4.171250896 × 1011 |
| Median Absolute Deviation (MAD) | 5.6658937 × 1010 |
| Skewness | 18.64226381 |
| Sum | 5.760497487 × 1014 |
| Variance | 5.671190033 × 1024 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.792457353 × 1011 | 1 | 0.1% |
| 3.94900026 × 1011 | 1 | 0.1% |
| 1.082385079 × 1011 | 1 | 0.1% |
| 2.0658045 × 1010 | 1 | 0.1% |
| 3.2217318 × 1010 | 1 | 0.1% |
| 3.46582179 × 1011 | 1 | 0.1% |
| 2.870114518 × 1011 | 1 | 0.1% |
| 7.525665158 × 1010 | 1 | 0.1% |
| 5.912305 × 1010 | 1 | 0.1% |
| 9.369707892 × 1011 | 1 | 0.1% |
| Other values (1371) | 1371 |
| Value | Count | Frequency (%) |
| 17990 | 1 | |
| 443708 | 1 | |
| 1000000 | 1 | |
| 9624196 | 1 | |
| 12527241 | 1 | |
| 14207000 | 1 | |
| 25134481 | 1 | |
| 26093137 | 1 | |
| 35377708 | 1 | |
| 45296000 | 1 |
| Value | Count | Frequency (%) |
| 6.2615849 × 1013 | 1 | |
| 4.401029861 × 1013 | 1 | |
| 2.125358533 × 1013 | 1 | |
| 1.754272879 × 1013 | 1 | |
| 1.473396004 × 1013 | 1 | |
| 1.1021135 × 1013 | 1 | |
| 1.051657116 × 1013 | 1 | |
| 8.98888574 × 1012 | 1 | |
| 8.667597705 × 1012 | 1 | |
| 8.417604 × 1012 | 1 |
| Distinct | 731 |
|---|---|
| Distinct (%) | 52.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 842.4221579 |
| Minimum | 1 |
|---|---|
| Maximum | 38622 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 60 |
| median | 210 |
| Q3 | 660 |
| 95-th percentile | 3477 |
| Maximum | 38622 |
| Range | 38621 |
| Interquartile range (IQR) | 600 |
Descriptive statistics
| Standard deviation | 2131.436433 |
|---|---|
| Coefficient of variation (CV) | 2.530128645 |
| Kurtosis | 93.81771377 |
| Mean | 842.4221579 |
| Median Absolute Deviation (MAD) | 188 |
| Skewness | 7.670661804 |
| Sum | 1163385 |
| Variance | 4543021.267 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 30 | 2.2% |
| 2 | 15 | 1.1% |
| 3 | 13 | 0.9% |
| 5 | 13 | 0.9% |
| 4 | 13 | 0.9% |
| 10 | 12 | 0.9% |
| 100 | 10 | 0.7% |
| 45 | 9 | 0.7% |
| 6 | 9 | 0.7% |
| 53 | 9 | 0.7% |
| Other values (721) | 1248 |
| Value | Count | Frequency (%) |
| 1 | 30 | |
| 2 | 15 | |
| 3 | 13 | |
| 4 | 13 | |
| 5 | 13 | |
| 6 | 9 | 0.7% |
| 7 | 7 | 0.5% |
| 8 | 7 | 0.5% |
| 9 | 4 | 0.3% |
| 10 | 12 | 0.9% |
| Value | Count | Frequency (%) |
| 38622 | 1 | |
| 23000 | 1 | |
| 20469 | 1 | |
| 17089 | 1 | |
| 14570 | 1 | |
| 13421 | 1 | |
| 13000 | 1 | |
| 12784 | 1 | |
| 12352 | 1 | |
| 11433 | 1 |
Activos Fijos
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 1338 |
|---|---|
| Distinct (%) | 96.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.183332675 × 1011 |
| Minimum | 0 |
|---|---|
| Maximum | 1.07708124 × 1014 |
| Zeros | 44 |
| Zeros (%) | 3.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 18346000 |
| Q1 | 3007724000 |
| median | 1.7937413 × 1010 |
| Q3 | 9.683993792 × 1010 |
| 95-th percentile | 1.191901918 × 1012 |
| Maximum | 1.07708124 × 1014 |
| Range | 1.07708124 × 1014 |
| Interquartile range (IQR) | 9.383221392 × 1010 |
Descriptive statistics
| Standard deviation | 3.409042978 × 1012 |
|---|---|
| Coefficient of variation (CV) | 8.149108002 |
| Kurtosis | 730.5028182 |
| Mean | 4.183332675 × 1011 |
| Median Absolute Deviation (MAD) | 1.7615085 × 1010 |
| Skewness | 24.57726945 |
| Sum | 5.777182424 × 1014 |
| Variance | 1.162157402 × 1025 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 44 | 3.2% |
| 7.169544316 × 1011 | 1 | 0.1% |
| 1.6697271 × 1010 | 1 | 0.1% |
| 1.882094512 × 1011 | 1 | 0.1% |
| 1.55887579 × 1011 | 1 | 0.1% |
| 1.8778156 × 1010 | 1 | 0.1% |
| 4146663139 | 1 | 0.1% |
| 2.865878542 × 1010 | 1 | 0.1% |
| 3.321499628 × 1010 | 1 | 0.1% |
| 76049000 | 1 | 0.1% |
| Other values (1328) | 1328 |
| Value | Count | Frequency (%) |
| 0 | 44 | |
| 2000 | 1 | 0.1% |
| 548000 | 1 | 0.1% |
| 1198621 | 1 | 0.1% |
| 1401750 | 1 | 0.1% |
| 1468166 | 1 | 0.1% |
| 1936936 | 1 | 0.1% |
| 2209272 | 1 | 0.1% |
| 3044000 | 1 | 0.1% |
| 4060000 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 1.07708124 × 1014 | 1 | |
| 4.257372232 × 1013 | 1 | |
| 1.8116425 × 1013 | 1 | |
| 1.7709473 × 1013 | 1 | |
| 1.759444784 × 1013 | 1 | |
| 1.5981658 × 1013 | 1 | |
| 1.341628877 × 1013 | 1 | |
| 1.309553558 × 1013 | 1 | |
| 1.1748621 × 1013 | 1 | |
| 1.104848718 × 1013 | 1 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| Bogota | |
|---|---|
| Cali | |
| Medellin | |
| Barranquilla | 77 |
| TBD | 32 |
Length
| Max length | 12 |
|---|---|
| Median length | 6 |
| Mean length | 6.325850833 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Bogota |
|---|---|
| 2nd row | Bogota |
| 3rd row | Bogota |
| 4th row | Bogota |
| 5th row | Cali |
Common Values
| Value | Count | Frequency (%) |
| Bogota | 848 | |
| Cali | 213 | 15.4% |
| Medellin | 189 | 13.7% |
| Barranquilla | 77 | 5.6% |
| TBD | 32 | 2.3% |
| TBD Colombia | 22 | 1.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| bogota | 848 | |
| cali | 213 | 15.2% |
| medellin | 189 | 13.5% |
| barranquilla | 77 | 5.5% |
| tbd | 54 | 3.8% |
| colombia | 22 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 19 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 KiB |
| Retail and Wholesale Trade | |
|---|---|
| Food System, Agribusiness and Beverage | |
| Construction Services | |
| Manufacturing | |
| Energy | |
| Other values (14) |
Length
| Max length | 38 |
|---|---|
| Median length | 22 |
| Mean length | 22.82621289 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Food System, Agribusiness and Beverage |
|---|---|
| 2nd row | Business and Personal Services |
| 3rd row | Transportation and Logistics |
| 4th row | Energy |
| 5th row | Pharmaceutical and Chemicals |
Common Values
| Value | Count | Frequency (%) |
| Retail and Wholesale Trade | 165 | |
| Food System, Agribusiness and Beverage | 147 | |
| Construction Services | 125 | |
| Manufacturing | 112 | |
| Energy | 111 | |
| Business and Personal Services | 107 | |
| Professional Services | 101 | |
| Financial Institutions | 97 | 7.0% |
| Technology and Communications | 94 | 6.8% |
| Pharmaceutical and Chemicals | 77 | 5.6% |
| Other values (9) | 245 |
Length
| Value | Count | Frequency (%) |
| and | 678 | |
| services | 394 | 10.4% |
| retail | 165 | 4.3% |
| wholesale | 165 | 4.3% |
| trade | 165 | 4.3% |
| food | 147 | 3.9% |
| system | 147 | 3.9% |
| agribusiness | 147 | 3.9% |
| beverage | 147 | 3.9% |
| construction | 125 | 3.3% |
| Other values (26) | 1514 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| TIER 4 | |
|---|---|
| TIER 3 | |
| TIER 1 | |
| TIER 2 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TIER 1 |
|---|---|
| 2nd row | TIER 3 |
| 3rd row | TIER 3 |
| 4th row | TIER 1 |
| 5th row | TIER 2 |
Common Values
| Value | Count | Frequency (%) |
| TIER 4 | 689 | |
| TIER 3 | 293 | |
| TIER 1 | 235 | 17.0% |
| TIER 2 | 164 | 11.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| tier | 1381 | |
| 4 | 689 | |
| 3 | 293 | 10.6% |
| 1 | 235 | 8.5% |
| 2 | 164 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1366 |
|---|---|
| Distinct (%) | 98.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5167595.525 |
| Minimum | -7258043.856 |
|---|---|
| Maximum | 697114645.7 |
| Zeros | 14 |
| Zeros (%) | 1.0% |
| Negative | 23 |
| Negative (%) | 1.7% |
| Memory size | 10.9 KiB |
Quantile statistics
| Minimum | -7258043.856 |
|---|---|
| 5-th percentile | 3061.22449 |
| Q1 | 172685.4286 |
| median | 943433.7551 |
| Q3 | 3197744.153 |
| 95-th percentile | 19100482.96 |
| Maximum | 697114645.7 |
| Range | 704372689.6 |
| Interquartile range (IQR) | 3025058.724 |
Descriptive statistics
| Standard deviation | 24070541.94 |
|---|---|
| Coefficient of variation (CV) | 4.657977163 |
| Kurtosis | 510.7242938 |
| Mean | 5167595.525 |
| Median Absolute Deviation (MAD) | 906459.3232 |
| Skewness | 19.49226026 |
| Sum | 7136449420 |
| Variance | 5.793909894 × 1014 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 0 | 14 | 1.0% |
| 3061.22449 | 2 | 0.1% |
| 1224.489796 | 2 | 0.1% |
| 2110064.585 | 1 | 0.1% |
| 2167778.699 | 1 | 0.1% |
| 2167145.35 | 1 | 0.1% |
| 2157843.214 | 1 | 0.1% |
| 2152662.295 | 1 | 0.1% |
| 2150200.776 | 1 | 0.1% |
| 2147742.644 | 1 | 0.1% |
| Other values (1356) | 1356 |
| Value | Count | Frequency (%) |
| -7258043.856 | 1 | |
| -4130887.936 | 1 | |
| -1822678.461 | 1 | |
| -1288362.122 | 1 | |
| -1242213.71 | 1 | |
| -1240228.357 | 1 | |
| -378463.2092 | 1 | |
| -358089.8572 | 1 | |
| -287446.0714 | 1 | |
| -249194.8571 | 1 |
| Value | Count | Frequency (%) |
| 697114645.7 | 1 | |
| 217903786.2 | 1 | |
| 213689703.1 | 1 | |
| 196827794 | 1 | |
| 187723942.5 | 1 | |
| 175177926.3 | 1 | |
| 102263065.8 | 1 | |
| 102005422 | 1 | |
| 97818904.61 | 1 | |
| 91935266.42 | 1 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.505430847 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 77 |
| Zeros (%) | 5.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9829006425 |
|---|---|
| Coefficient of variation (CV) | 0.6529032166 |
| Kurtosis | 1.614981294 |
| Mean | 1.505430847 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.314584933 |
| Sum | 2079 |
| Variance | 0.9660936729 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 848 | |
| 2 | 213 | 15.4% |
| 3 | 189 | 13.7% |
| 0 | 77 | 5.6% |
| 4 | 32 | 2.3% |
| 5 | 22 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 77 | 5.6% |
| 1 | 848 | |
| 2 | 213 | 15.4% |
| 3 | 189 | 13.7% |
| 4 | 32 | 2.3% |
| 5 | 22 | 1.6% |
| Value | Count | Frequency (%) |
| 5 | 22 | 1.6% |
| 4 | 32 | 2.3% |
| 3 | 189 | 13.7% |
| 2 | 213 | 15.4% |
| 1 | 848 | |
| 0 | 77 | 5.6% |
Industry_cat
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 19 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.715423606 |
| Minimum | 0 |
|---|---|
| Maximum | 18 |
| Zeros | 15 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 8 |
| Q3 | 14 |
| 95-th percentile | 17 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 5.671129624 |
|---|---|
| Coefficient of variation (CV) | 0.6507003997 |
| Kurtosis | -1.354557831 |
| Mean | 8.715423606 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.2289347205 |
| Sum | 12036 |
| Variance | 32.16171122 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 165 | |
| 6 | 147 | |
| 2 | 125 | |
| 8 | 112 | |
| 3 | 111 | |
| 1 | 107 | |
| 13 | 101 | |
| 5 | 97 | 7.0% |
| 17 | 94 | 6.8% |
| 11 | 77 | 5.6% |
| Other values (9) | 245 |
| Value | Count | Frequency (%) |
| 0 | 15 | 1.1% |
| 1 | 107 | |
| 2 | 125 | |
| 3 | 111 | |
| 4 | 22 | 1.6% |
| 5 | 97 | |
| 6 | 147 | |
| 7 | 61 | |
| 8 | 112 | |
| 9 | 14 | 1.0% |
| Value | Count | Frequency (%) |
| 18 | 66 | 4.8% |
| 17 | 94 | |
| 16 | 165 | |
| 15 | 12 | 0.9% |
| 14 | 21 | 1.5% |
| 13 | 101 | |
| 12 | 21 | 1.5% |
| 11 | 77 | |
| 10 | 13 | 0.9% |
| 9 | 14 | 1.0% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.9 KiB |
| 3 | |
|---|---|
| 2 | |
| 0 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 689 | |
| 2 | 293 | |
| 0 | 235 | 17.0% |
| 1 | 164 | 11.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 3 | 689 | |
| 2 | 293 | |
| 0 | 235 | 17.0% |
| 1 | 164 | 11.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.505430847 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 77 |
| Zeros (%) | 5.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9829006425 |
|---|---|
| Coefficient of variation (CV) | 0.6529032166 |
| Kurtosis | 1.614981294 |
| Mean | 1.505430847 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.314584933 |
| Sum | 2079 |
| Variance | 0.9660936729 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 848 | |
| 2 | 213 | 15.4% |
| 3 | 189 | 13.7% |
| 0 | 77 | 5.6% |
| 4 | 32 | 2.3% |
| 5 | 22 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 77 | 5.6% |
| 1 | 848 | |
| 2 | 213 | 15.4% |
| 3 | 189 | 13.7% |
| 4 | 32 | 2.3% |
| 5 | 22 | 1.6% |
| Value | Count | Frequency (%) |
| 5 | 22 | 1.6% |
| 4 | 32 | 2.3% |
| 3 | 189 | 13.7% |
| 2 | 213 | 15.4% |
| 1 | 848 | |
| 0 | 77 | 5.6% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.505430847 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 77 |
| Zeros (%) | 5.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9829006425 |
|---|---|
| Coefficient of variation (CV) | 0.6529032166 |
| Kurtosis | 1.614981294 |
| Mean | 1.505430847 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.314584933 |
| Sum | 2079 |
| Variance | 0.9660936729 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 848 | |
| 2 | 213 | 15.4% |
| 3 | 189 | 13.7% |
| 0 | 77 | 5.6% |
| 4 | 32 | 2.3% |
| 5 | 22 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 77 | 5.6% |
| 1 | 848 | |
| 2 | 213 | 15.4% |
| 3 | 189 | 13.7% |
| 4 | 32 | 2.3% |
| 5 | 22 | 1.6% |
| Value | Count | Frequency (%) |
| 5 | 22 | 1.6% |
| 4 | 32 | 2.3% |
| 3 | 189 | 13.7% |
| 2 | 213 | 15.4% |
| 1 | 848 | |
| 0 | 77 | 5.6% |
IndustryCode
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 19 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.715423606 |
| Minimum | 0 |
|---|---|
| Maximum | 18 |
| Zeros | 15 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 8 |
| Q3 | 14 |
| 95-th percentile | 17 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 5.671129624 |
|---|---|
| Coefficient of variation (CV) | 0.6507003997 |
| Kurtosis | -1.354557831 |
| Mean | 8.715423606 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.2289347205 |
| Sum | 12036 |
| Variance | 32.16171122 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 165 | |
| 6 | 147 | |
| 2 | 125 | |
| 8 | 112 | |
| 3 | 111 | |
| 1 | 107 | |
| 13 | 101 | |
| 5 | 97 | 7.0% |
| 17 | 94 | 6.8% |
| 11 | 77 | 5.6% |
| Other values (9) | 245 |
| Value | Count | Frequency (%) |
| 0 | 15 | 1.1% |
| 1 | 107 | |
| 2 | 125 | |
| 3 | 111 | |
| 4 | 22 | 1.6% |
| 5 | 97 | |
| 6 | 147 | |
| 7 | 61 | |
| 8 | 112 | |
| 9 | 14 | 1.0% |
| Value | Count | Frequency (%) |
| 18 | 66 | 4.8% |
| 17 | 94 | |
| 16 | 165 | |
| 15 | 12 | 0.9% |
| 14 | 21 | 1.5% |
| 13 | 101 | |
| 12 | 21 | 1.5% |
| 11 | 77 | |
| 10 | 13 | 0.9% |
| 9 | 14 | 1.0% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.9 KiB |
| 3 | |
|---|---|
| 2 | |
| 0 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 689 | |
| 2 | 293 | |
| 0 | 235 | 17.0% |
| 1 | 164 | 11.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 3 | 689 | |
| 2 | 293 | |
| 0 | 235 | 17.0% |
| 1 | 164 | 11.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.9 KiB |
| 3 | |
|---|---|
| 2 | |
| 0 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 689 | |
| 2 | 293 | |
| 0 | 235 | 17.0% |
| 1 | 164 | 11.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 3 | 689 | |
| 2 | 293 | |
| 0 | 235 | 17.0% |
| 1 | 164 | 11.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Empresa | Annual Turnover | Employee Count | Activos Fijos | Aon Office | Industry | TIER GENERAL | Produccion | Aon Office_cat | Industry_cat | TIER_cat | AonOffice_cat | AonOfficeCode | IndustryCode | TIERCode | TIERcode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 132 | 579245735319 | 3038 | 716954431572 | Bogota | Food System, Agribusiness and Beverage | TIER 1 | -7258043.85604 | 1 | 6 | 0 | 1 | 1 | 6 | 0 | 0 |
| 1 | 1061 | 67176109000 | 975 | 38508144000 | Bogota | Business and Personal Services | TIER 3 | -4130887.93592 | 1 | 1 | 2 | 1 | 1 | 1 | 2 | 2 |
| 2 | 622 | 154255168000 | 1587 | 87866173000 | Bogota | Transportation and Logistics | TIER 3 | -1822678.46122 | 1 | 18 | 2 | 1 | 1 | 18 | 2 | 2 |
| 3 | 173 | 1271848954000 | 157 | 462891146000 | Bogota | Energy | TIER 1 | -1288362.12245 | 1 | 3 | 0 | 1 | 1 | 3 | 0 | 0 |
| 4 | 144 | 433259253705 | 558 | 129907322353 | Cali | Pharmaceutical and Chemicals | TIER 2 | -1242213.71020 | 2 | 11 | 1 | 2 | 2 | 11 | 1 | 1 |
| 5 | 1318 | 4215551465 | 2 | 8300039261 | Medellin | Power | TIER 4 | -1240228.35714 | 3 | 12 | 3 | 3 | 3 | 12 | 3 | 3 |
| 6 | 111 | 2490236912000 | 898 | 450914685000 | Bogota | Manufacturing | TIER 1 | -378463.20918 | 1 | 8 | 0 | 1 | 1 | 8 | 0 | 0 |
| 7 | 61 | 1670917995000 | 5 | 3144028890000 | Bogota | Energy | TIER 1 | -358089.85718 | 1 | 3 | 0 | 1 | 1 | 3 | 0 | 0 |
| 8 | 304 | 845609000000 | 94 | 17709473000000 | Medellin | Manufacturing | TIER 1 | -287446.07143 | 3 | 8 | 0 | 3 | 3 | 8 | 0 | 0 |
| 9 | 567 | 307944030000 | 123 | 19417683000 | Bogota | Retail and Wholesale Trade | TIER 2 | -249194.85714 | 1 | 16 | 1 | 1 | 1 | 16 | 1 | 1 |
Last rows
| Empresa | Annual Turnover | Employee Count | Activos Fijos | Aon Office | Industry | TIER GENERAL | Produccion | Aon Office_cat | Industry_cat | TIER_cat | AonOffice_cat | AonOfficeCode | IndustryCode | TIERCode | TIERcode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1371 | 47 | 4535456052999 | 2470 | 6555499649859 | Bogota | Food System, Agribusiness and Beverage | TIER 1 | 91935266.41633 | 1 | 6 | 0 | 1 | 1 | 6 | 0 | 0 |
| 1372 | 45 | 4856876387000 | 555 | 4120829242000 | Barranquilla | Power | TIER 1 | 97818904.61350 | 0 | 12 | 0 | 0 | 0 | 12 | 0 | 0 |
| 1373 | 31 | 687744946352 | 1244 | 143590594585 | Bogota | Energy | TIER 1 | 102005422.04305 | 1 | 3 | 0 | 1 | 1 | 3 | 0 | 0 |
| 1374 | 30 | 748710891000 | 1411 | 184115035000 | Bogota | Energy | TIER 1 | 102263065.75811 | 1 | 3 | 0 | 1 | 1 | 3 | 0 | 0 |
| 1375 | 20 | 2107769994850 | 1521 | 412903928190 | Bogota | Financial Institutions | TIER 1 | 175177926.27857 | 1 | 5 | 0 | 1 | 1 | 5 | 0 | 0 |
| 1376 | 10 | 5225686577000 | 6223 | 4931788938000 | Bogota | Energy | TIER 1 | 187723942.53265 | 1 | 3 | 0 | 1 | 1 | 3 | 0 | 0 |
| 1377 | 5 | 8667597705000 | 10824 | 13416288766000 | Barranquilla | Aviation | TIER 1 | 196827793.98032 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1378 | 2 | 44010298607360 | 20469 | 5017659967440 | Medellin | Financial Institutions | TIER 1 | 213689703.14898 | 3 | 5 | 0 | 3 | 3 | 5 | 0 | 0 |
| 1379 | 7 | 2977918775740 | 7866 | 379604155600 | Bogota | Financial Institutions | TIER 1 | 217903786.17753 | 1 | 5 | 0 | 1 | 1 | 5 | 0 | 0 |
| 1380 | 9 | 6061683220000 | 5224 | 6363713096000 | Bogota | Energy | TIER 1 | 697114645.73375 | 1 | 3 | 0 | 1 | 1 | 3 | 0 | 0 |